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SEQUENCE LISTING 



<110> Roche Vitamins AG 

<120> ACC gene 

<130> NDR5217 

<160> 10 

<170> Patentln version 3.1 

<210> 1 

<211> 10561 

<212> DNA 

<213> Phaffia rhodozyma 
<220> 

<221> 5*UTR 

<222> (1221) . . (1222) 

<223> 

<220> 

<221> polyA_site 

<222> (9813) . . (9814) 
<223> 

<220> 

<221> exon 

<222> (1446) . . (1482) 

<223> 

<220> 

<221> exon 
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<222> (9231). . (9530) 
<223> 

<220> 

<221> exon 
<222> (7296) . . (9160) 
<223> 

<220> 

<221> exon 
<222> (7048) . . (7227) 
<223> 

<220> 

<221> exon 
<222> (6899) . . (6976) 
<223> 

<220> 

<221> exon 
<222> (5871). .(6832) 
<223> 

<220> 

<221> exon 
<222> (5674) . . (5805) 
<223> 

<220> 

<221> exon 
<222> (5456) . . (5608) 
<223> 

<220> 
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<221> exon 

<222> (4984) .. (5384) 

<223> 

<220> 

<221> exon 

<222> (4096) . . (4911) 

<223> 

<220> 

<221> exon 

<222> (3828) . . (4026) 

<223> 

<220> 

<221> exon 

<222> (3075). .(3443) 

<223> 

<220> 

<221> exon 

<222> (3518). .(3552) 

<223> 

<220> 

<221> exon 

<222> (1676). .(1758) 

<223> 

<220> 

<221> exon 

<222> (1833) . . (1957) 

<223> 
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<220> 

<221> exon 
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<222> (2031) . . (2171) 
<223> 



<220> 

<221> exon 

<222> (2244) . . (2641) 

<223> 



<220> 

<221> exon 

<222> (2746) . . (2991) 

<223> 



<220> 

<221> exon 

<222> (3626). .(3750) 

<223> 



<220> 

<221> Intron 

<222> (1483) . . (1675) 

<223> 



<220> 

<221> intron 

<222> (4912) . . (4983) 

<223> 



<220> 

<221> intron 

<222> (3751).. (3827) 

<223> 
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<220> 

<221> Intron 

<222> (5385).. (5455) 

<223> 

<220> 

<221> Intron 

<222> C6977) . . (7047) 

<223> 

<220> 

<221> Intron 

<222> (3553).. (3625) 

<223> 

<220> 

<221> Intron 

<222> (5806). . (5870) 

<223> 

<220> 

<221> intron 

<222> (9161) . . (9230) 

<223> 

<220> 

<221> Intron 

<222> (3444).. (3517) 

<223> 

<220> 

<221> intron 

<222> (2992) . . (3074) 
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■ <223> 
<220> 

<221> Intron 

<222> (7228).. (7295) 

<223> 

<220> 

<221> intron 

<222> (6833) . . (6898) 

<223> 

<220> 

<221> Intron 

<222> (5609) .. (5673) 

<223> 

<220> 

<221> intron 

<222> (4027) . . (4095) 

<223> 

<220> 

<221> Intron 

<222> (2642) .. (2745) 

<223> 

<220> 

<221> intron 

<222> (2172) . . (2243) 

<223> 

<220> 

<221> intron 
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<220> 

<221> intron 

<222> (1759). .(1832) 

<223> 



<400> 1 














caacagacag 


acaaaggaac 


ttacgtgtac 


atactggtct 


ttccaatgtc 


geggegtega 


60 


gattaactag 


aacaatactt 


gacaatcgaa 


tctcttattc 


tgccctagtt 


gaaggegtet 


120 


gttcaaattg 


atcaagatct 


tccaatcatt 


gacatccagg 


tattegcatt 


cgactctgct 


180 


cgtatgtact 


gttccgattt 


tcttatggcc 


accagatttc 


aactctgata 


tacattggtt 


240 


caccctgtct 


ttgtctcttt 


gcctttcgtt 


ccatctagcg 


ctgttcaacg 


gatcactcag 


300 


tcggcttgac 


tcaactccct 


ctggaacgtg 


tgecttatet 


caggttctga 


tttctcctca 


360 


gccagtatgc 


gcacaaagca 


gcgatcgtga 


etttttgetc 


cataagacct 


etcagegggg 


420 


aatatatgac 


actcatacat 


cgatagctcg 


tatgttttct 


ttgatcactt 


cctaaaatgt 


480 


aacggcaact 


gacattcaac 


atgatgcgct 


ttcatagatc 


aactacttcc 


gactacgatg 


540 


accgttcttc 


tatacagccc 


agtcagctcg 


tcgacctcac 


ataaagtgac 


tgagaccgcg 


600 


atctcgaaca 


tcttattcct 


tccaccgtta 


gctgagaagt 


ggattacacc 


atcaatagaa 


660 


tcatctaccc 


cgttcttgcc 


tggactaatg 


egtcaggage 


tcttggataa 


aggagaaata 


720 


gctgagcaga 


ccatcacctt 


ggatgatgtc 


cgtctgtggc 


tgaactcegg 


aggtcgagtg 


780 


gcgtgctgca 


acgcacttcg 


aggaatttgg 


gaagtgaacc 


tcgtttggag 


tgataaatga 


840 


gattacgaaa 


gtctgttcga 


aacatccatg 


cttcatgata 


accgataacg 


cttaaatctt 


900 


gagagtgcgc 


acatcgatcg 


ccttttatat 


atggggttgg 


ggaaacataa 


agtgttcata 


960 


gactattgtt 


catatatctt 


aaagtacaaa 


gaegcatcta 


accctaagcc 


tgaatgattg 


1020 


gcaaaatcct 


agtaagaccg 


tgaaattccg 


aagaataege 


agttcattaa 


taaagatata 


1080 


gcttaggtaa 


gcagcggttg 


ctcccccaac 


caacctcatc 


cgaaattccc 


cagggggttg 


1140 


agattctcaa 


ggctttgaat 


ccccatcccg 


tcaagttggt 


cttaaaccct 


tcatctctac 


1200 


ttgttacttc 


ttttcttctt 


gacctccttc 


ccccactccc 


tcctattctc 


tgaacgaact 


1260 


cgcctccctg 


tccatctact 


cttcttcggt 


tttcttttgg 


gtttttactt 


ttctcgttcc 


1320 


tcctccatct 


ttccatctct 


tttcgtatct 


gtgggtaact 


ttgcatccaa 


gggccctcac 


1380 


acataaccct 


atatccatct 


tcctccattc 


acacacatct 


gtactcaacc 


aacaaagctc 


1440 


acaag atg gtt gtc gat 


cac gag age gta agg cat ttc ate g 


1482 


Met val Val Asp 
1 


His Glu ser val Arg His Phe He 
5 10 
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gtaagcgttc ttgttctttt ccttgtctgg ctccctgcat tttcttaaac gatctaggaa 

gagagggaaa ttacatctgg tcaattttcc gcgctctttt ccttggggac aaaagaatgc 

ctttctgtga tcggagatcg gttgctgatc tcttttgtct tgttcttttt gctctttccc 

tcccctttac cag gt gga aac gca ctt gag aac gcc cct ccg tea age 
Gly Gly Asn Ala Leu Glu Asn Ala Pro Pro Ser Ser 
15 20 

gtc acc gat ttc gtt aga agt caa gat ggt cac acg gtc ate acc aaa 
Val Thr Asp Phe val Arg Ser Gin Asp Gly His Thr val lie Thr Lys 
25 30 35 40 

gtcagtaatt ttcatttttt ccttcacgta gcctcagggc caaggagcta aattgettet 

gtatcatttc tcag gtc etc att gcc aac aac gga ate get get gta aaa 
Val Leu He Ala Asn Asn Gly lie Ala Ala Val Lys 



45 



50 



gag ate cga tea gtt cgt aaa tgg get tac gag acg ttt gga gat gag 
Glu He Arg Ser Val Arg Lys Trp Ala Tyr Glu Thr Phe Gly Asp Glu 
55 60 65 

cga gcc ate gaa ttt acg gta atg gcc act cca gaa gat tt 
Arg Ala lie Glu Phe Thr Val Met Ala Thr Pro Glu Asp Leu 
70 75 80 

gttegtacca atcacataag ctttccttga gtcagggaca tcctctaatt aattcaactt 

gagegecata cag g aag gtg aac tgc gac tat att cga atg get gat cga 
Lys val Asn cys Asp Tyr lie Arg Met Ala Asp Arg 
85 90 

gtc gtc gaa gtt cct gga gga act aac aac aac aat cac tct aac gtc 
val val Glu val Pro Gly Gly Thr Asn Asn Asn Asn His Ser Asn val 



95 



100 



105 



110 



gac etc ate gtt gac att gcc gag cga ttc aat ata cat get gtt tgg 
Asp Leu He val Asp lie Ala Glu Arg Phe Asn lie His Ala val Trp 
115 120 125 

get gga tg gtaagtaaaa taggacctta acatgttgga agaagagtgt 
Ala Gly Trp 



1542 
1602 
1662 
1710 

1758 

1818 
1868 

1916 

1957 

2017 
2067 

2115 

2163 

2211 



ccacttaaac gegctttett tccatccgac ag 



aga ctt 
Arg Leu 



ggt cct 
Gly Pro 



acc ate 
Thr lie 
170 

tea ggc 
Ser Gly 
185 



ccc gag 
Pro Glu 
140 

ccc gga 
Pro Gly 
155 

gtt gcc 
val Ala 



ate act 
He Thr 



tct etc 
Ser Leu 



tec get 
Ser Ala 



cag tct 
Gin Ser 



gat aca 
Asp Thr 
190 



gcc gcc tea 
Ala Ala ser 
145 

atg cga tec 
Met Arg ser 
160 

gcc cag gtq 
Ala Gin val 
175 

gag etc age 
Glu Leu ser 



g ggt cac get teg gaa aac ccc 
Gly His Ala ser Glu Asn Pro 
130 135 

aag aac aag ate gtc ttc att 

Lys Asn Lys He Val Phe lie 
150 

ctt gga gac aag att tct teg 

Leu Gly Asp Lys lie ser ser 
165 

ccg tgt atg gcc tgg tct gga 

Pro cys Met Ala Trp ser Gly 
180 



cct cag ggc ttc gtg act gtq 
Pro Gin Gly Phe val Thr Val 
195 200 



ccc gat ggg cca tat cag get get tgt 



gta aag acg gtg gag gat ggt 
8 



2265 



2313 



2361 



2409 



2457 



2505 
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Pro Asp Gly Pro Tyr Gin Ala Ala Cys val Lys Thr val Glu Asp Gly 

205 210 215 

ttg gtq cga gcc gag aag ate got ttg cca gtt atg ate aag gec tct 

Leu val Arg Ala Glu Lys lie Gly Leu Pro val Met lie Lys Ala ser 



220 



225 



230 



gag gga gqa gga gga aag gqt ate cga atg gtt cac age atg gac aca 
Glu Gly Gly Gly Gly Lys Gly lie Arg Met val His ser Met Asp Thr 
235 240 245 



2553 



2601 



ttc aag aac tec tac aac tec gtc get tec gag gtq cca g gtaagttcac 
Phe Lys Asn ser Tyr Asn ser val Ala ser Glu val Pro 
250 255 260 

tctgtttgac tggagatttg agcacaatct ctaccatggg agttcaagaa ggaataccca 

ctcatgaatt gaegactgeg ttcttgacct ctag ga tct ccg att ttc ate atg 

Gly Ser Pro lie Phe lie Met 
265 

gcc ttg get gga tct get cga cat ttg gag gtc cag etc ctt get gat 
Ala Leu Ala Gly Ser Ala Arg His Leu Glu val Gin Leu Leu Ala Asp 
270 275 280 

cag tac gga aac get ate tct ttg ttc got cga gat tgc tct gtt cag 
Gin Tyr Gly Asn Ala lie Ser Leu Phe Gly Arg Asp Cys Ser Val Gin 



285 



290 



295 



300 



cga cga cat cag aag ate att gag gag get ccc gtc acg ate get cgt 

Arg Arg His Gin Lys lie lie Glu Glu Ala Pro val Thr lie Ala Arg 
305 310 315 

cca gag aga ttc gaa gag atg gag aag get get gtc agg ttg gcc aag 

Pro Glu Arg Phe Glu Glu Met Glu Lys Ala Ala val Arg Leu Ala Lys 
320 325 330 

tta gta gcia tat gtt agt gcc ggt ace gtc gaa t gtaaggaaca 

335 



Leu Val Gly Tyr val Ser Ala Gly Thr val Glu 

340 



aacagctacc tctcattctg ttttttcgag atagtcaact tacatcactt ttcttttgee 

ggattttctt tag ac etc tac tct cac gcc gac gac tea ttc ttc ttc 
Tyr Leu Tyr Ser His Ala Asp Asp Ser Phe Phe Phe 
345 350 355 

etc gaa etc aac cct cga ctt caa gtc gag cac cct act ace gag atg 
Leu Glu Leu Asn Pro Arg Leu Gin val Glu His Pro Thr Thr Glu Met 
360 365 370 



gtc teg gqt gtc aac ctt ccc get get cag ctt cag att get atg gqt 
Val Ser Gly Val Asn Leu Pro Ala Ala Gin Leu Gin lie Ala Met Gly 
375 380 385 



ate cct ctt tct cga att egg gat att cga gtc etc tac gqt etc gat 
lie Pro Leu Ser Arg lie Arg Asp lie Arg val Leu Tyr Gly Leu Asp 
390 395 ~ 400 

ccc cac act gtt tec gag ate gac ttc gac age age aga gcg gag tct 
Pro His Thr val Ser Glu lie Asp Phe Asp Ser Ser Arg Ala Glu Ser 
405 410 415 

gtc cag act cag agg aag cct agg ccc aag gqt cac gtc att gcc tgt 
Val Gin Thr Gin Arg Lys Pro Arg Pro Lys Gly His Val lie Ala Cys 
420 425 ~ 430 435 

cga ate acg agt gaa aac ccc gat gag ggg ttc aag ccg tct gcc gga 

9 



2651 

2711 
2765 

2813 

2861 

2909 

2957 

3001 

3061 
3109 

3157 

3205 

3253 

3301 

3349 

3397 



WO 2004/029232 



PCT/EP2003/010683 



sequencel i sti ng-21416 . ST25 
Arg lie Thr ser Glu Asn Pro Asp Glu Gly Phe Lys Pro Ser Ala Gly 
440 445 450 

gat ate caa gag ttg aac ttc aga agt aat act aac gtc tgg gga t 
Asp lie Gin Glu Leu Asn Phe Arg ser Asn Thr Asn Val Trp Gly 
455 460 465 

gtgagtacag aggcttctca aagattctta tgtggaacaa atctctgact cttaaattgt 

gtttgacttt caag ac ttc tct gtt gga get act gga gga att cat agt 
Tyr Phe Ser val Gly Ala Thr Gly Gly lie His Ser 

475 



470 



gtaagtttct tcgccaacaa tataatcaca ctagatccct atctaatctg aactggctta 

tctcttgtta tag ttc gec gat tct caa ttc ggt cac gtg ttt get tat 
Phe Ala Asp Ser Gin Phe Gly His val Phe Ala Tyr 



480 



485 



490 



ggc tec gac cga acg act gec aga aag aat atg gtt ate gee ttg aaa 
Gly Ser Asp Arg Thr Thr Ala Arg Lys Asn Met Val lie Ala Leu Lys 
495 500 505 

gag ctt tec att cga gga gac ttc cga ace act gtc gag ta 
Glu Leu ser lie Arg Gly Asp Phe Arg Thr Thr Val Glu Tyr 
510 515 

gtgegtatag cctggtacat ctcctttcaa tcacttacga tgaactgacc gatctgtctc 

gatcacgttt aatctag t ctt ate act ctt ctt gag acg age gat ttc gag 

Leu lie Thr Leu Leu Glu Thr Ser Asp Phe Glu 
525 530 

cag aac gec att acc acc get tgg ttg gat ggg ttg ate act aac aag 
Gin Asn Ala lie Thr Thr Ala Trp Leu Asp Gly Leu lie Thr Asn Lys 
535 540 545 

ctt aca tct gag agg cct gat cca tea ctg gee gtt att tgt ggt gca 
Leu Thr Ser Glu Arg Pro Asp Pro ser Leu Ala Val lie cys Gly Ala 
550 " 555 560 

att gtg aaa get cac gtq get tct gag aac tgt tgg gee gaa tac cga 
lie Val Lys Ala His val Ala Ser Glu Asn Cys Trp Ala Glu Tyr Arg 



565 



570 



575 



cga gta ttg gac aag gga cag gtaagctctg tttctcatga agtttttgac 
Arg Val Leu Asp Lys Gly Gin 



580 



58 



tgaggcactc accactccgt acatgtttcc tgtttttag gtt ccc tec aag gac 

Val Pro Ser Lys Asp 
590 



act etc aag aca gtq ttc act ctt gat ttc ate tat gag ggt gtt egg 
Thr Leu Lys Thr val Phe Thr Leu Asp Phe lie Tyr Glu Gly val Arg 
595 600 605 



tac aat ttc acc get get cga gee tec etc aac act tac cga ttg tat 

Tyr Asn Phe Thr Ala Ala Arg Ala Ser Leu Asn Thr Tyr Arg Leu Tyr 

610 615 620 

eta aac gga gga aag acc gtg gtg tec ate cga cct ttg gee gat ggt 

Leu Asn Gly Gly Lys Thr val val Ser lie Arg Pro Leu Ala Asp Gly 

625 630 635 

gga atg etc gtt ctt etc gat ggc cga tec cac act etc tac tgg agg 

Gly Met Leu val Leu Leu Asp Gly Arg ser His Thr Leu Tyr Trp Arg 



640 



645 



650 
10 



655 



3443 

3503 
3552 

3612 
3661 

3709 

3750 

3810 
3861 

3909 

3957 

4005 

4056 

4110 

4158 

4206 

4254 

4302 
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gag gaa gtc ggt acc etc cga att cag gta gac gca aag act tgc ctg 

Glu Glu val Gly Thr Leu Arg lie Gin val Asp Ala Lys Thr cys Leu 

660 * 665 670 

att gag cag gag aac gac ccc act cag etc cga tea ccc teg cct gga 

lie Glu Gin Glu Asn Asp Pro Thr Gin Leu Arg Ser Pro ser pro Gly 

675 680 " 685 

aag ate ate egg ttt ttg gtc gaa age gga gat cac ate tec tec gga 

Lys lie lie Arg Phe Leu val Glu ser Gly Asp His lie ser ser Gly 

690 695 700 



4350 



4398 



4446 



gat ate tat get gag gtt gag gtc atg aag atg ate ttg ccc ttg att 
Asp lie Tyr Ala Glu val Glu val Met Lys Met lie Leu Pro Leu lie 
705 710 715 



gee cag gag tec gqt cac gtt cag ttt gtc aag caa gee ggt gtg acc 
Ala Gin Glu ser Gly His val Gin Phe Val Lys Gin Ala Gly val Thr 
720 725 730 



r 
735 



val Asp Pro Gly Ala lie lie Gly lie Leu Ser Leu Asp Asp Pro Thr 
740 * 745 750 



gtc gat cct gga gcg.att att ggg ate ttg agt ctt gat gac cct acg 

- ify Ala lie lie Gl> ~" 

cga gtg aag aag gcg aag ccc ttc gag ggt etc ctg cct gtg act ggt 

Arg val Lys Lys Ala Lys Pro Phe Glu Gly Leu Leu Pro Val Thr Gly 
755 760 765 

etc cct aac ctg ccc gqt aac aga cct cac cag egg eta cag ttc cag 

Leu Pro Asn Leu Pro Gly Asn Arg Pro His Gin Arg Leu Gin Phe Gin 
770 775 780 

ctt gag teg ata tac teg gtc ttg gat gga tac gag agt gac tec act 

Leu Glu Ser lie Tyr ser val Leu Asp Gly Tyr Glu Ser Asp Ser Thr 
785 790 795 

gca aca ate etc cga tea ttc tct gaa aac ctt tat gat cct gat ctt 

Ala Thr lie Leu Arg Ser Phe Ser Glu Asn Leu Tyr Asp Pro Asp Leu 
800 805 810 815 

get ttc gga gag get tta tec ate att tec gtc ctt tct ggg aga atg 

Ala Phe Gly Glu Ala Leu Ser lie lie Ser val Leu Ser Gly Arg Met 
820 825 830 

cct gec gat ctt gag gag age att cga gag gtc ate age gaa get cag 

Pro Ala Asp Leu Glu Glu Ser lie Arg Glu Val lie Ser Glu Ala Gin 
" ~ 0 



835 



84 



845 



teg aag cct cac gee gag ttc cct gga tea aag gtgtgtagtt gategcagag 
Ser Lys Pro His Ala Glu Phe Pro Gly Ser Lys 
850 855 



4494 



4542 



4590 



4638 



4686 



4734 



4782 



4830 



4878 



4931 



ttatgactgt atacatcgac cagaagctta cccatctctt tcgtgtgcac ag ate etc 

lie Leu 
860 

aaa gtc gtc gag egg tac ate gat aat ttg cga cct cag gag agg get 
Lys val Val Glu Arg Tyr lie Asp Asn Leu Arg Pro Gin Glu Arg Ala 
865 870 875 

atg gtc cga act cag ate gaa ccc ate gtt ggt att get gag aag aac 
Met Val Arg Thr Gin lie Glu pro He Val Gly lie Ala Glu Lys Asn 
880 885 890 

gtt ggc ggt cct aag ggt tac gee tct tac gtc tta get acc ate ctt 
Val Gly Gly Pro Lys Gly Tyr Ala ser Tyr val Leu Ala Thr lie Leu 



895 



900 



905 



4989 



5037 



5085 



5133 
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caa aag ttc ttg gcc gtt gag gcc gtt ttt get act ggt agt gaa gag 
Gin Lys Phe Leu Ala val Glu Ala Val Phe Ala Thr Gly Ser Glu Glu 
910 915 920 



gtc ctt ggt etc gtc ctg get cac teg cgt etc age get cga tec aag 
val Leu Gly Leu val Leu Ala His Ser Arg Leu Ser Ala Arg ser Lys 
945 950 955 



5181 



gcc att gtt etc caa ctt cga gat gaa aac cga gaa tct ttg aac gac 5229 
Ala lie val Leu Gin Leu Arg Asp Glu Asn Arg Glu Ser Leu Asn Asp 
925 930 935 940 



5277 



ctt gtt etc tec gtc ttt gat ctg ate aag tct atg cag etc etc aac 5325 
Leu val Leu Ser Val Phe Asp Leu lie Lys Ser Met Gin Leu Leu Asn 
960 965 970 

aac act gag ggt tct ttc ctt cat aag act atg aaa gcg ctt gcc gac 5373 
Asn Thr Glu Gly Ser Phe Leu His Lys Thr Met Lys Ala Leu Ala Asp 
975 980 985 

atg ccc acc aa gtaggtttcc tcttgtagtt tacaaactat tgttgcgatg 5424 
Met Pro Thr Lys 
990 

tgttgacaaa gactctgttt ccgatctata g g get cct ttg gcc age aag gtg 5477 

Ala Pro Leu Ala Ser Lys val 
995 

tct ttg aag get egg gaa att ctt ate tct tgc tct ctt ccc tct 5522 
Ser Leu Lys Ala Arg Glu lie Leu lie ser cys ser Leu pro ser 
1000 1005 1010 

tac gag gag agg ttg ttc cag atg gaa aag ate ctt aac tct tct 5567 
Tyr Glu Glu Arg Leu Phe Gin Met Glu Lys lie Leu Asn ser Ser 
1015 1020 1025 

gtc acc act tct tac tac gga gag act gga ggt gga cac ag 5608 
Val Thr Thr ser Tyr Tyr Gly Glu Thr Gly Gly Gly His Arg 
1030 1035 1040 

gtttgtcctc tcccatgtgt ttctagttca tagctctctg ctgactctga tccgattttc 5668 

aacag a aac cct teg gtt gat gtt ctg act gag ate tea aac tct 5713 
Asn Pro Ser val Asp val Leu Thr Glu lie Ser Asn ser 
1045 1050 1055 

cga ttc acc gtc tac gat gtc ctg tec tec ttc ttc aag cac gat 5758 
Arg Phe Thr Val Tyr Asp Val Leu Ser ser Phe Phe Lys His Asp 
1060 1065 1070 

gat cct tgg att gtt ctt get agt ttg acc gtc tac gtt ctt cga 5803 
Asp Pro Trp lie Val Leu Ala ser Leu Thr Val Tyr val Leu Arg 
1075 1080 1085 

gc gtaagtgatc gttcttctcc tcttgcccaa acaatgactg acagttctat 5855 
Ala 

ctattccatc tgcag t tac cga gag tac agt att ctt gat atg caa cat 5904 

Tyr Arg Glu Tyr ser lie Leu Asp Met Gin His 
1090 1095 

gag caa ggt cag gat ggc get get gga gtc ate act tgg cga ttc 5949 
Glu Gin Gly Gin Asp Gly Ala Ala Gly Val lie Thr Trp Arg Phe 
1100 1105 1110 

aag etc aac cag ccc ate get gag tct tct act ccc cga gtt gac 5994 

12 
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Lys Leu Asn 
1115 



sequence! i sti ng-21416 . ST2 5 
Gin Pro He Ala Glu Ser Ser Thr Pro Arg Val Asp 
1120 1125 



teg aat cga 
Ser Asn Arg 
1130 



gac gtt tac cga gtc gqt teg ctt tct gat 
Asp Val Tyr Arg val Gly Ser Leu Ser 



1135 



1140 



Asp 



ttg acc 
Leu Thr 



6039 



tac aag ate 
Tyr Lys lie 
1145 

atg acg age 
Met Thr ser 
1160 



aag eag agt 
Lys Gin ser 



ttc aac aac 
Phe Asn Asn 



cag acc gag ccc etc cga get 
Gin Thr Glu Pro Leu Arg Ala GTy Val 
1150 1155 



g^t gtc 



ttg _ _ 

Leu Lys Glu Val Gin Asp Gly 
1165 1170 



aag gag gtt cag gac gga 



etc ttg 
Leu Leu 



6084 



6129 



aat gtt ctg 
Asn val Leu 
1175 



tct ttc ttc cct get tac cat cat caa gat ttc act 
Ser Phe Phe Pro Ala Tyr His His Gin Asp Phe Thr 
1180 1185 



6174 



caa cga cat 
Gin Arg His 
1190 



gqt cag gac 
Gly Gin Asp 



agt gee atg ccc aac gtt etc aac att 
ser Ala Met Pro Asn val Leu Asn He 
1195 1200 



6219 



get ate egg 
Ala lie Arg 
1205 



get ttc gag gag aag gac gac atg tct gat ctt gat 
Ala Phe Glu Glu Lys Asp Asp Met ser Asp Leu Asp 
1210 1215 



6264 



tgg gee aag 
Trp Ala Lys 
1220 



agt gtt gag teg ctg gta atg cag atg tct gee gag 
Ser val Glu Ser Leu val Met Gin Met Ser Ala Glu 
1225 1230 



6309 



ate cag aag 
lie Gin Lys 
1235 



aag gga att 
Ty il 



cga cga gtt acc ttc ttg gtt tgc cga 
Lys Gly lie Arg Arg Val Thr Phe Leu Val Cys Arg 
1240 ~ 1245 



6354 



aag gqc gtt 
Lys Gly Val 
1250 



tac ccc tec tac ttc acc ttc aga caa gag 
Tyr Pro ser Tyr Phe Thr Phe Arg Gin Glu GTy Ala 
1255 1260 



gqt gee 
slv 



6399 



cag gqc ccc 
Gin Gly Pro 
1265 



tgg aga gag 
Trp Arg Glu 



gag gag aag att cga aac ate gag cct 
Glu Glu Lys lie Arg Asn lie Glu Pro 
1270 1275 



6444 



get eta gee 
Ala Leu Ala 
1280 



agt cag ctt gag etc aac cga etc teg aat ttc aag 
ser Gin Leu Glu Leu Asn Arg Leu ser Asn Phe Lys 
1285 1290 



6489 



gtc acc cct 
val Thr Pro 
1295 

gga gtq gqt 
Gly val Gly 
1310 



ate ttc gta 
lie Phe val 



aag gag aac 
Lys Glu Asn 



gac aac aga cag ate cac ate tac aag 6534 
Asp Asn Arg Gin lie His lie Tyr Lys 
1300 " 1305 

tct tec gat gtt cga ttc ttt ate egg 6579 
Ser ser Asp Val Arg Phe Phe lie Arg 
1315 1320 



get ttg gtt 
Ala Leu val 
1325 



cga cct gga 
Arg Pro Gly 



egg 
Arg 
1330 



gtc cag 
val Gin 



gga teg 
Gly ser 



atg aag 
Met Lys 
1335 



get gee 
Ala Ala 



6624 



gag tat etc 
Glu Tyr Leu 
1340 



ate tec gag 
lie Ser Glu 



tgc 
Cys 
1345 



gat cga 
Asp Arg 



ctg etc 
Leu Leu 



act gat 
Thr Asp 
1350 



ate ctg 
lie Leu 



6669 



gac gee ttg 
Asp Ala Leu 
1355 



gag gtt gtt 
Glu val val 



gga 
Gly 
1360 



gee gag 
Ala Glu 



act cga 
Thr Arg 



aac gee 
Asn Ala 
1365 



gat tgc 
Asp Cys 



6714 



aac cat gtt gga att aac ttc ate tat 



aac gtt 
13 



ctt gtc gac ttc 
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sequence! i sti ng-21416 . ST25 
Asn His Val Gly lie Asn Phe lie Tyr Asn val Leu val Asp Phe 
1370 1375 ^ 1380 

gac gac gtc cag gag gcc ctt gcc ggg ttc att gag agg cac gga 
Asp Asp val Gin Glu Ala Leu Ala Gly Phe lie Glu Arg His Gly 
1385 1390 1395 

aag agg ctt tgg cga ctt cga gtg acc g gtaagtgttc tctcggcatt 
Lys Arg Leu Trp Arg Leu Arg Val Thr 
1400 1405 

gaattcagca atgagctgtg actaacgggt ttcttcggta tattag ct tct gaa 

Ala Ser Glu 
1410 

ate cga atg gtt ctt gag gac gac gag ggt aac gtc acc ccc ate 
lie Arg Met Val Leu Glu Asp Asp Glu Gly Asn val Thr Pro lie 
1415 1420 1425 

cga tgc tgc att gag aac gtt tct g gtaagcagtc caaaataact 
Arg cys cys lie Glu Asn Val Ser 
1430 

gataatccta ttcagtctag acattgtaac tgatgcattt ctegttctta g gt ttc 

Gly Phe 
1435 

gtc gtg aag tac cac gcc tac cag gag gtt gag acc gag aag ggt 
val val Lys Tyr His Ala Tyr Gin Glu val Glu Thr Glu Lys Gly 



1440 



1445 



1450 



act acc ate ttg aag tea ate gga gac ctt gga cct ctt cac ctt 
Thr Thr lie Leu Lys ser lie Gly Asp Leu Gly pro Leu His Leu 
1455 1460 1465 

cag cct gtc aac cat get tac cag acc aag aac agt ctt cag ccc 
Gin pro Val Asn His Ala Tyr Gin Thr Lys Asn ser Leu Gin Pro 
1470 1475 1480 

cga cga tac cag get cac ttg gtt gga acg act tac gtc t 
Arg Arg Tyr Gin Ala His Leu val Gly Thr Thr Tyr Val 
1485 1490 

gttagtcaca tttcatgetc tggttttctg accgtcactg . gttattgacg ttctgtttgg 

cgtcacag ac gac tac ccc gat etc ttc gtt cag agt ttg cgc aag 
Tyr Asp Tyr Pro Asp Leu Phe Val Gin ser Leu Arg Lys 
1495 1500 1505 

gtt. tgg get gag get get get aag att cct cac etc egg gtg cct 
val Trp Ala Glu Ala Ala Ala Lys lie Pro His Leu Arg val Pro 
1510 1515 1520 

age gag cct ctt acc get acc gag ttg gtt etc gat gag aac aac 
ser Glu Pro Leu Thr Ala Thr Glu Leu val Leu Asp Glu Asn Asn 
1525 1530 1535 

gag ctt cag gag gtc gag cga cct ccg ggt tec aac teg tgt ggt 
Glu Leu Gin Glu Val Glu Arg Pro Pro Gly Ser Asn ser Cys Gly 
1540 " 1545 1550 

atg gtc gcc tgg ate ttc act atg etc act ccc gag tat ccc aag 
Met Val Ala Trp^ lie Phe Thr Met Leu Thr Pro Glu Tyr Pro Lys 



1560 



1565 



ggt cga cga gta gtt gcc att gcc aac gat ate acc ttc aag att 
Gly Arg Arg Val val Ala lie Ala Asn Asp lie Thr Phe Lys lie 
1570 1575 1580 

14 



6804 

6852 

6906 

6951 

6996 

7052 

7097 

7142 

7187 

7227 

7287 
7333 

7378 

7423 

7468 

7513 

7558 
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gga tec ttt ggt cct aag gaa gac gat tac ttc ttc aag get act 

Gly ser Phe Gly Pro Lys Glu Asp Asp Tyr Phe Phe Lys Ala Thr 
1585 1590 1595 

gaa att gec aag aag ctg ggc ctt cct cga att tac etc tct gec 

Glu lie Ala Lys Lys Leu Gly Leu Pro Arg lie Tyr Leu Ser Ala 
1600 1605 ' 1610 

aac agt gga get aga etc ggt ate gcg gag gag etc ttg cac ate 

Asn ser Gly Ala Arg Leu Gly He Ala Glu Glu Leu Leu His lie 
1615 " 1620 1625 

ttc aag gcg gee ttc gtt gac ccc gca aag cct tec atg ggt att 

Phe Lys Ala Ala Phe Val Asp Pro Ala Lys Pro Ser Met Gly He 
1630 1635 1640 

aag tat eta tac ttg ace cct gaa act tta tec act ctt gec aag 

Lys Tyr Leu Tyr Leu Thr Pro Glu Thr Leu ser Thr Leu Ala Lys 
1645 1650 1655 

aag gga tec age gtc ace act gag gag ate gag gat gac ggc gag 

Lys Gly ser Ser Val Thr Thr Glu Glu lie Glu Asp Asp Gly Glu 
1660 1665 1670 

cga cga cac aag ate acc gec ate ate ggt ctt gca gag ggt ttg 

Arg Arg His Lys lie Thr Ala lie lie Gly Leu Ala Glu Gly Leu 
1675 1680 1685 

gga gtt gag tct ctt cga gga tec ggt ctt att get gga gec acc 

Gly val Glu ser Leu Arg Gly ser Gly Leu lie Ala Gly Ala Thr 
1690 1695 1700 

act cga get tac gag gag gga ate ttc acc ate tct etc gtt act 

Thr Arg Ala Tyr Glu Glu Gly lie Phe Thr lie ser Leu val Thr 



gee cga teg gtc ggt ate gga get tac ttg gtt cga ttg ggt cag 

Ala Arg Ser val Gly lie Gly Ala Tyr Leu val Arg Leu Gly Gin 
1720 1725 1730 

cga get att cag gtt gaa ggc aac cct atg ate ctt act gga get 

Arg Ala lie Gin Val Glu Gly Asn Pro Met lie Leu Thr Gly Ala 
1735 1740 1745 

cag tct etc aac aag gtg ctt gga cga gag gtt tac act tec aac 

Gin Ser Leu Asn Lys Val Leu Gly Arg Glu Val Tyr Thr Ser Asn 
1750 1755 1760 

ctt cag. ctt gga gga acc cag att atg gee cga aac ggt acc acg 

Leu Gin Leu Gly Gly Thr Gin lie Met Ala Arg Asn Gly Thr Thr 
1765 1770 " 1775 

cat etc gtc get gaa tct gat etc gat ggt get etc aag gtc ate 

His Leu Val Ala Glu Ser Asp Leu Asp Gly Ala Leu Lys Val lie 
1780 1785 1790 

cag tgg etc teg tat gtg ccc gag cga aag ggc aag gee att cct 

Gin Trp Leu ser Tyr Val Pro Glu Arg Lys Gly Lys Ala lie Pro 
1795 1800 1805 

ate tgg cct tec gag gac cct tgg gac cga act gtg acc tac gag 

lie Trp Pro Ser Glu Asp Pro Trp Asp Arg Thr Val Thr Tyr Glu 
1810 1815 1820 

cct ccc cga ggt cct tac gat cct cga tgg ttg ctt gaa gga aag 

Pro Pro Arg Gly Pro Tyr Asp Pro Arg Trp Leu Leu Glu Gly Lys 



7603 
7648 
7693 
7738 
7783 
7828 
7873 
7918 
7963 
8008 
8053 
8098 
8143 
8188 
8233 
8278 
8323 



1825 



1830 



1835 
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Pro Asp 



gaa ggc ttg 
Glu Gly Leu 
1840 
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act ggt ctt ttc gac aag gga tct 
Thr Gly Leu Phe Asp Lys Gly ser 
1845 
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ttc atg 
Phe Met 
1850 



8368 



gag acc 
Glu Thr 



cga ctg 
Arg Leu 



gat 

Leu Gly Asp 
1855 



gqa ggc att 
Gly Gly lie 



tgg gcc aag act ate gtc acc ggt cga gec 
Trp Ala Lys Thr He Val Thr Gly Arg Ala 
1860 1865 



1870 



cct atg ggt gtt att get gtc gaa 
Pro Met Gly val lie Ala val Glu 
1875 



acc agg 
Thr Arg 
1880 



8413 



8458 



acg acc 
Thr Thr 



ttc gag 
Phe Glu 



get get 
Ala Ala 



ggt ctt 
Gly Leu 



gga cag 
Gly Gin 



ate gtt 
lie val 



gtt gtc 
val val 



gat cct 
Asp Pro 



acc get 
Thr Ala 



aag ttc 
Lys Phe 



gag aag ate 
Glu Lys lie 
1885 

caa aag att 
Gin Lys lie 
1900 

tac aag acc 
Tyr Lys Thr 
1915 

cct ttg atg 
Pro Leu Met 
1930 



ate get gcc gat cct gcc aac cct gca get 
lie Ala Ala Asp Pro Ala Asn Pro Ala Ala 
1890 1895 



atg gag get ggt cag gtt tgg aac 
Met Glu Ala Gly Gin Val Trp Asn 
1905 



ccc aac 
Pro Asn 
1910 



get caa tec ate ttt gat ate aac aag gag 
Ala Gin ser lie Phe Asp lie Asn Lys Glu 
1920 1925 



atg 

Gly Asp Met 
1945 



ggt gat 

Sly 



gac ggt etc 
Asp Gly Leu 
1960 



ccc aac gga 
Pro Asn Gly 
1975 

act ate aac 
Thr lie Asn 
1990 



ate ctt gcc aac ate cga ggt ttc 

lie Leu Ala Asn lie Arg Gly Phe 
. 1935 

ttt gac get ate etc aag cag ggt 

Phe Asp Ala lie Leu Lys Gin Gly 
1950 

teg aac ttc aag cag cca gtg ttc 

Ser Asn Phe Lys Gin Pro val Phe 
1965 

gag ctt cgt gga gga get tgg gtc 

Glu Leu Arg Gly Gly Ala Trp Val 
1980 



tct gga 
Ser Gly 
1940 

tct aag 
ser Lys 
1955 

gtc tat 
val Tyr 
1970 

gtg ttg 
val Leu 
1985 



cga gga 



gga 



Arg Gly Gly 
2005 

cga cga gac 
Arg Arg Asp 
2020 



ctt gcc aag atg gag atg tac get gat gaa 
Leu Ala Lys Met Glu Met Tyr Ala Asp Glu 
1995 2000 

att etc gag ccg gaa ggt ate gtt gag ate 
lie Leu Glu Pro Glu Gly lie val Glu lie 
2010 2015 

aag gtc ate get acc atg gag cga ttg gac 
Lys Val lie Ala Thr Met Glu Arg Leu Asp 
2025 2030 



8503 



8548 



8593 



8638 



8683 



8728 



8773 



8818 



8863 



8908 



gag acc 
Glu Thr 



tct gcg 
Ser Ala 



gag act 
Glu Thr 



get gat 
Ala Asp 



tat gcc tct 
Tyr Ala ser 
2035 

gag gag cga 
Glu Glu Arg 
2050 



etc aaa get gcc teg aac gac tea acc aag 

Leu Lys Ala Ala Ser Asn Asp Ser Thr Lys 
2040 2045 

get aag agt get gag eta etc aag gca aga 

Ala Lys ser Ala Glu Leu Leu Lys Ala Arg 
2055 2060 



eta ctt caa 
Leu Leu Gin 
2065 

etc cat gat 
Leu His Asp 
2080 



ccg acg tac ttg cag att gca cac ctt tac 
Pro Thr Tyr Leu Gin lie Ala His Leu Tyr 
2070 2075 



cgt gtc gga cga atg gag gcc aag 
Arg val Gly Arg Met Glu Ala Lys 
2085 

16 



ggt tgc 
Gly cys 
2090 



8953 



8998 



9043 



9088 
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gcg aag cga get gtc tgg get gag get cga cga ttc ttc tac tgg 
Ala Lys Arg Ala Val Trp Ala Glu Ala Arg Arg Phe Phe Tyr Trp 
2095 2100 2105 



9133 



cga ctt cga cga cgt etc aac gat gag 
Arg Leu Arg Arg Arg Leu Asn Asp Glu 
2110 2115 



gtgagccgtc ccattcactc 



9180 



tttcgttgca aggttcagta gtactaaccg cttctttctt tatctatcag cac ate 9236 

His lie 



ctg tct aag ttc get get gec aac ccg gat ctt act etc 
Leu Ser Lys Phe Ala Ala Ala Asn Pro Asp Leu Thr Leu 
2120 2125 2130 

cga caa aac att etc gac tct gtc gtc cag act gac etc 
Arg Gin Asn lie Leu Asp Ser val val Gin Thr Asp Leu 
2135 2140 2145 

gac cga gec acc get gaa tgg att gag cag tct gca gaa 
Asp Arg Ala Thr Ala Glu Trp lie Glu Gin ser Ala Glu 
2150 2155 2160 

get get gec gtt gee gaa gtc cga tec acc tac gtg teg 
Ala Ala Ala val Ala Glu Val Arg ser Thr Tyr Val Ser 
2165 2170 2175 

att ate age ttc gee gag acg gag cga get gqa gcg ttg 
lie lie Ser Phe Ala Glu Thr Glu Arg Ala Gly Ala Leu 
2180 2185 2190 

ttg gtc get gtc ttg age act ttg aat gcg gaa gac aag 
Leu val Ala val Leu Ser Thr Leu Asn Ala Glu Asp Lys 
2195 2200 2205 



gag gag 
Glu Glu 



act gat 
Thr Asp 



gag att 
Glu lie 



aat aag 
Asn Lys 



cag gqc 
Gin Gly 



aag gec 
Lys Ala 



5281 



9326 



9371 



9416 



9461 



9506 



ctt gtt tct age ctt ggt etc taa 
Leu val Ser ser Leu Gly Leu 
2210 


attttaattt 


tttttgtcga 


tgetattett 


9560 


cctatcttta 


gtctttgatt 


aacttttgaa 


tatccttcat 


agatctttcc 


ttgeatacat 


9620 


tgatattatt 


tcctcacccg 


tttttatgta 


cttccatacg 


agtttccatt 


tttttctget 


9680 


tttatatttc 


gaetacaegt 


cgactgttca 


cctgcctctc 


ttttgttctt 


tctgttctgt 


9740 


tttcttctgt 


tctttcgcct 


cttgggattc 


tatattctcc 


ttegcattta 


catatgetea 


9800 


tgttaatgtc 


tgactcagag 


ttcactagga 


tatgtcgtga 


gagecegaaa 


caagttgeae 


9860 


aacatatatt 


gataatgatc 


agaacactct 


aagaccaccc 


agtccatgat 


cagccgcatc 


9920 


gecagttteg 


atctcttctc 


cattctcatc 


aacctcaatc 


tcctcccgga 


tcgtcctgcc 


9980 


cagcagactg 


ccgaataact 


cgtcgacctg 


ctcctcctgc 


cacaagtctt 


ccgttcgctc 


10040 


aggaaccatg 


aagttcatga 


tcttttcttg 


gggggtatat 


egaagcttge 


gacctttaga 


10100 


agctcgtgta 


tcgagggtgg 


gcttgtgctt 


tttgggtccg 


taattggaaa 


aggttgcttg 


10160 


gectatttea 


aaataaacga 


aattgatgat 


tatacaccgc 


cgtagaccgt 


ttctggtcag 


10220 


gattttgtgt 


tggacgatga 


tataccgatc 


gatgtttgag 


cagaeaaggg 


agttaggaag 


10280 


agactactta 


ccactcatag 


cgccgactcc 


agcacctcca 


cctcttcgct 


egatgaegtc 


10340 


tctgaccaag 


ctctggtaaa 


actctttgtc 


atcaccccaa 


acggcggcct 


cacattcagc 


10400 
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ctcatcctga gagacgagtc ccatgaaccg atctactttt ttcctaccct ctagaccctc 10460 
aagggaagct ccaatttgct cgacgactcc gatcttgacg gatttaaact tttcacctcg 10520 



aagattctga aggccttgag cggtcataat cttggaagac c 



10561 



<210> 2 

<211> 6645 

<212> DNA 

<213> Phaffia rhodozyma 
<220> 

<221> CDS 

<222> (1) . . C6645) 

<223> 



<400> 2 
atg gtt gtc 
Met val val 
1 

ctt gag aac 
Leu Glu Asn 



gat ggt cac 
Asp Gly His 
35 

get get gta 
Ala Ala val 
50 

ttt gqa gat 
Phe Gly Asp 
65 

gat ttg aag 
Asp Leu Lys 



gat cac gag age 
Asp His Glu Ser 

gec cct ccg tea 
Ala Pro Pro Ser 
20 

acg gtc ate acc 
Thr Val lie Thr 



gta agg 
val Arg 



aaa gag 
Lys Glu 



gag cga 
Glu Arg 



gaa gtt cct 
Glu val Pro 



ate gtt gac 
lie Val Asp 
115 

tgg ggt cac 
Trp Gly His 
130 

tea aag aac 
ser Lys Asn 
145 



gtq aac 
Val Asn 
85 

gqa gqa 
Gly Gly 
100 

att gee 
He Ala 



ate cga 
lie Arg 
55 

gee ate 
Ala lie 
70 

tgc gac 
cys Asp 



act aac 
Thr Asn 



gag cga 
Glu Arg 



age gtc 
Ser val 
25 

aaa gtc 
Lys Val 
40 

tea gtt 
Ser val 



gaa ttt 
Glu Phe 



tat att 
Tyr lie 



cat ttc ate 
His Phe lie 
10 

acc gat ttc 
Thr Asp Phe 



get teg gaa aac 
Ala ser Glu Asn 
135 



aac aac 
Asn Asn 
105 

ttc aat 
Phe Asn 
120 

ccc aga 
pro Arg 



etc att gee 
Leu lie Ala 



cgt aaa tgg 
Arg Lys Trp 
60 

acg gta atg 
Thr Val Met 
75 

cga atg get 
Arg Met Ala 
90 

aat cac tct 
Asn His ser 



aag ate gtc ttc 
Lys lie Val Phe lie GTy 
150 



att ggt 



ata cat get 
lie His Ala 



ctt ccc gag 
Leu Pro Glu 
140 

cct ccc gqa 
Pro Pro Gly 
155 
18 



ggt gqa 
Gly Gly 



gtt aga 
val Arg 
30 

aac aac 
Asn Asn 
45 

get tac 
Ala Tyr 



gee act 
Ala Thr 



gat cga 
Asp Arg 



aac gtc 
Asn val 
110 

gtt tgg 
val Trp 
125 

tct etc 
Ser Leu 



aac gca 
Asn Ala 
15 

agt caa 
Ser Gin 



gqa ate 
Gly lie 



gag acg 
Glu Thr 



cca gaa 
Pro Glu 
80 

gtc gtc 
val Val 
95 

gac etc 
Asp Leu 



get gqa 
Ala Gly 



gee gee 
Ala Ala 



tec get atg cga 
Ser Ala Met Arg 
160 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 
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tec ctt gga gac aag att tct teg acc ate gtt gec cag tct gec cag 528 
Ser Leu Gly Asp Lys lie Ser ser Thr lie Val Ala Gin Ser Ala Gin 
165 170 175 

gtg ccg tgt atg gec tgg tct gqa tea ggc ate act gat aca gag etc 576 
Val Pro Cys Met Ala Trp ser Gly ser Gly lie Thr Asp Thr Glu Leu 
180 185 190 

age cct cag ggc ttc gtq act gtq ccc gat ggg cca tat cag get get 624 
Ser Pro Gin Gly Phe Val Thr Val Pro Asp Gly Pro Tyr Gin Ala Ala 
195 200 205 

tgt gta aag acg gtq gag gat ggt ttg gtg cga gee gag aag ate ggt 672 
cys val Lys Thr Val Glu Asp Gly Leu val Arg Ala Glu Lys lie Gly 
210 215 220 



ttg cca gtt atg ate aag gec tct gag gga gga gga gga aag ggt ate 
Leu Pro Val Met lie Lys Ala ser Glu Gly Gly Gly Gly Lys Gly lie 
225 230 235 240 



get tec gag gtq cca gga tct ccg att ttc ate atg gee ttg get gga 

Ala ser Glu Val Pro Gly ser Pro lie Phe lie Met Ala Leu Ala Gly 
260 265 270 

tct get cga cat ttg gag gtc cag etc ctt get gat cag tac gga aac 

ser Ala Arg His Leu Glu Val Gin Leu Leu Ala Asp Gin Tyr Gly Asn 
275 280 285 



gcg gag tct gtc cag act cag agg aag cct agg ccc aag ggt cac gtc 
Ala Glu Ser Val Gin Thr Gin Arg Lys Pro Arg Pro Lys Gly His Val 
420 425 ' 430 

19 



720 



cga atg gtt cac age atg gac aca ttc aag aac tec tac aac tec gtc 768 
Arg Met Val His ser Met Asp Thr Phe Lys Asn Ser Tyr Asn Ser Val 
245 250 255 



816 



864 



get ate tct ttg ttc ggt cga gat tgc tct gtt cag cga cga cat cag 912 

Ala lie Ser Leu Phe Gly Arg Asp cys Ser Val Gin Arg Arg His Gin 

290 295 300 

aag ate att gag gag get ccc gtc acg ate get cgt cca gag aga ttc 960 

Lys lie He Glu Glu Ala Pro Val Thr lie Ala Arg Pro Glu Arg Phe 

305 310 315 ~ 320 

gaa gag atg gag aag get get gtc agg ttg gee aag tta gta gga tat 1008 

Glu Glu Met Glu Lys Ala Ala val Arg Leu Ala Lys Leu val Gly Tyr 

325 330 335 

gtt agt gee ggt acc gtc gaa tac etc tac tct cac gec gac gac tea 1056 

val ser Ala Gly Thr val Glu Tyr Leu Tyr ser His Ala Asp Asp Ser 

340 345 350 

ttc ttc ttc etc gaa etc aac cct cga ctt caa gtc gag cac cct act 1104 

Phe Phe Phe Leu Glu Leu Asn Pro Arg Leu Gin val Glu His Pro Thr 

355 360 ~ 365 

acc gag atg gtc teg ggt gtc aac ctt ccc get get cag ctt cag att 1152 

Thr Glu Met val ser Gly val Asn Leu Pro Ala Ala Gin Leu Gin lie 

370 375 380 

get atg ggt ate cct ctt tct cga att egg gat att cga gtc etc tac 1200 

Ala Met Gly lie Pro Leu ser Arg lie Arg Asp lie Arg val Leu Tyr 

385 390 ~ 395 400 

ggt etc gat ccc cac act gtt tec gag ate gac ttc gac age age aga 1248 

Gly Leu Asp Pro His Thr val ser Glu lie Asp Phe Asp Ser ser Arg 

405 410 415 



1296 
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att gcc tgt cga ate acg agt gaa aac ccc gat gag gqg ttc aag ccg 
lie Ala cys Arg He Thr ser Glu Asn Pro Asp Glu Gly Phe Lys Pro 
435 440 445 

tct gcc gga gat ate caa gag ttg aac ttc aga agt aat act aac gtc 
Ser Ala Gly Asp lie Gin Glu Leu Asn Phe Arg Ser Asn Thr Asn Val 
450 455 460 

tgg gga tac ttc tct gtt gga get act gga gga att cat agt ttc gcc 
Trp Gly Tyr Phe Ser val Gly Ala Thr Gly Gly lie His Ser Phe Ala 
465 470 475 480 

gat tct caa ttc ggt cac gtg ttt get tat ggc tec gac cga acg act 
Asp Ser Gin Phe Gly His Val Phe Ala Tyr Gly Ser Asp Arg Thr Thr 
485 490 495 

gcc aga aag aat atg gtt ate gcc ttg aaa gag ctt tec att cga gga 
Ala. Arg Lys Asn Met Val lie Ala Leu Lys Glu Leu Ser lie Arg Gly 
500 505 510 

gac ttc cga ace act gtc gag tat ctt ate act ctt ctt gag acg age 
Asp Phe Arg Thr Thr val Glu Tyr Leu lie Thr Leu Leu Glu Thr Ser 



gat ttc gag cag aac gcc att ace ace get tgg ttg gat ggg ttg ate 

Asp Phe Glu Gin Asn Ala lie Thr Thr Ala Trp Leu Asp Gly Leu lie 
530 535 540 

act aac aag ctt aca tct gag agg cct gat cca tea ctg gcc gtt att 

Thr Asn. Lys Leu Thr ser Glu Arg Pro Asp Pro ser Leu Ala val lie 

545 550 " 555 560 

tgt gqt gca att gtq aaa get cac gtg get tct gag aac tgt tgg gcc 

Cys Gly Ala lie Val Lys Ala His val Ala Ser Glu Asn cys Trp Ala 

565 570 575 

gaa tac cga cga gta ttg gac aag gga cag gtt ccc tec aag gac act 

Glu Tyr Arg Arg Val Leu Asp Lys Gly Gin val Pro Ser Lys Asp Thr 

580 585 590 

etc aag aca gtg ttc act ctt gat ttc ate tat gag ggt gtt egg tac 

Leu Lys Thr val Phe Thr Leu Asp Phe lie Tyr Glu Gly val Arg Tyr 

595 600 605 

aat ttc acc get get cga gcc tec etc aac act tac cga ttg tat eta 

Asn Phe Thr Ala Ala Arg Ala ser Leu Asn Thr Tyr Arg Leu Tyr Leu 
610 615 620 

aac gga gga aag acc gtq gtg tec ate cga cct ttg gcc gat ggt gga 

Asn Gly Gly Lys Thr val val ser lie Arg Pro Leu Ala Asp Gly Gly 

625 630 635 640 

atg etc gtt ctt etc gat ggc cga tec cac act etc tac tgg agg gag 

Met Leu val Leu Leu Asp Gly Arg Ser His Thr Leu Tyr Trp Arg Glu 

645 ~ 650 655 

gaa gtc ggt acc etc cga att cag gta gac gca aag act tgc ctg att 

Glu Val Gly Thr Leu Arg lie Gin Val Asp Ala Lys Thr Cys Leu lie 

660 " 665 670 

gag cag gag aac gac ccc act cag etc cga tea ccc teg cct gga aag 

Glu Gin Glu Asn Asp Pro Thr Gin Leu Arg Ser Pro Ser Pro Gly Lys 

675 680 685 

ate ate egg ttt ttg gtc gaa age gga gat cac ate tec tec gga gat 

lie lie Arg Phe Leu val Glu Ser Gly Asp His lie Ser Ser Gly Asp 



1344 
1392 
1440 
1488 
1536 
1584 
1632 
1680 
1728 
1776 
1824 
1872 
1920 
1968 
2016 
2064 
2112 





690 



695 



700 
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ate tat get gag gtt gag gtc atg aag atg ate ttg ccc ttg att gec 
lie Tyr Ala Glu Val Glu Val Met Lys Met He Leu Pro Leu lie Ala 
705 710 715 720 

cag gag tec ggt cac gtt cag ttt gtc aag caa gee ggt gtq ace gtc 
Gin Glu ser Gly His val Gin Phe Val Lys Gin Ala Gly val Thr Val 
725 730 735 

gat cct gga gcg att att ggg ate ttg agt ctt gat gac cct acg cga 
Asp Pro Gly Ala lie lie Gly lie Leu ser Leu Asp Asp Pro Thr Arg 
740 745 750 

gtq aag aag gcg aag ccc ttc gag ggt etc ctg cct gtq act gqt etc 
Val Lys Lys Ala Lys Pro Phe Glu Gly Leu Leu Pro Val Thr Gly Leu 
755 760 " 765 

cct aac ctg ccc ggt aac aga cct cac cag egg eta cag ttc cag ctt 
Pro Asn Leu Pro Gly Asn Arg Pro His Gin Arg Leu Gin Phe Gin Leu 
770 775 780 

gag teg ata tac teg gtc ttg gat gga tac gag agt gac tec act gca 
Glu ser lie Tyr Ser val Leu Asp Gly Tyr Glu Ser Asp Ser Thr Ala 
785 790 795 800 

aca ate etc cga tea ttc tct gaa aac ctt tat gat cct gat ctt get 
Thr lie Leu Arg ser Phe ser Glu Asn Leu Tyr Asp Pro Asp Leu Ala 
805 810 815 

ttc gga gag get tta tec ate att tec gtc ctt tct ggg aga atg cct 
Phe Gly Glu Ala Leu ser lie lie Ser val Leu ser Gly Arg Met Pro 
820 825 830 

gee gat ctt gag gag age att cga gag gtc ate age gaa get cag teg 
Ala Asp Leu Glu Glu Ser lie Arg Glu Val lie Ser Glu Ala Gin ser 
835 840 845 

aag cct cac gee gag ttc cct gga tea aag ate etc aaa gtc gtc gag 
Lys Pro His Ala Glu Phe Pro Gly ser Lys lie Leu Lys Val val Glu 
850 855 860 

egg tac ate gat aat ttg cga cct cag gag agg get atg gtc cga act 
Arg Tyr lie Asp Asn Leu Arg Pro Gin Glu Arg Ala Met val Arg Thr 
865 870 875 880 

cag ate gaa ccc ate gtt ggt att get gag aag aac gtt ggc ggt cct 
Gin lie Glu Pro lie Val Gly lie Ala Glu Lys Asn val Gly Gly Pro 
885 * 890 895 

aag ggt tac gee tct tac gtc tta get ace ate ctt caa aag ttc ttg 
Lys Gly Tyr Ala ser Tyr val Leu Ala Thr lie Leu Gin Lys Phe Leu 
900 905 910 

gee gtt gag gee gtt ttt get act ggt agt gaa gag gee att gtt etc 
Ala val Glu Ala val phe Ala Thr Gly ser Glu Glu Ala lie val Leu 
915 920 925 

caa ctt cga gat gaa aac cga gaa tct ttg aac gac gtc ctt ggt etc 
Gin Leu Arg Asp Glu Asn Arg Glu ser Leu Asn Asp val Leu Gly Leu 
930 . 935 940 

gtc ctg get cac teg cgt etc age get cga tec aag ctt gtt etc tec 
val Leu Ala His ser Arg Leu ser Ala Arg Ser Lys Leu Val Leu Ser 
945 950 955 960 

gtc ttt gat ctg ate aag tct atg cag etc etc aac aac act gag ggt 
val Phe Asp Leu lie Lys Ser Met Gin Leu Leu Asn Asn Thr Glu Gly 



2160 
2208 
2256 
2304 
2352 
2400 
2448 
2496 
2544 
2592 
2640 
2688 
2736 
2784 
2832 
2880 
2928 



965 



970 



975 
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tct ttc ctt cat aag act atg aaa gcg ctt gcc gac atg ccc acc aag 
Ser Phe Leu His Lys Thr Met Lys Ala Leu Ala Asp Met Pro Thr Lys 
980 985 990 

get cct ttg gcc age aag gtg tct ttg aag get egg gaa att ctt ate 
Ala Pro Leu Ala ser Lys vat ser Leu Lys Ala Arg Glu lie Leu lie 
995 1000 1005 

tct tgc tct ctt ccc tct tac gag gag agg ttg ttc cag atg gaa 
Ser cys ser Leu Pro ser Tyr Glu Glu Arg Leu Phe Gin Met Glu 
1010 1015 ~ 1020 

aag ate ctt aac tct tct gtc acc act tct tac tac gga gag act 
Lys lie Leu Asn Ser Ser Val Thr Thr Ser Tyr Tyr Gly Glu Thr 
1025 1030 1035 

gga gqt gga cac aga aac cct teg gtt gat gtt ctg act gag ate 
Gly Gly Gly His Arg Asn Pro Ser val Asp val Leu Thr Glu lie 
1040 ~ 1045 1050 

tea aac tct cga ttc acc gtc tac gat gtc ctg tec tec ttc ttc 
ser Asn ser Arg Phe Thr val Tyr Asp val Leu Ser Ser Phe Phe 
1055 1060 1065 

aag cac gat gat cct tgg att gtt ctt get agt ttg acc gtc tac 
Lys His Asp Asp Pro Trp lie val Leu Ala Ser Leu Thr val Tyr 
1070 1075 1080 

gtt ctt cga get tac cga gag tac agt att ctt gat atg caa cat 
Val Leu Arg Ala Tyr Arg Glu Tyr ser lie Leu Asp Met Gin His 
1085 ~ 1090 1095 

gag caa gqt cag gat gqc get get gga gtc ate act tgg cga ttc 
Glu Gin Gly Gin Asp Gly Ala Ala Gly Val lie Thr Trp Arg Phe 
1100 1105 1110 

aag etc aac cag ccc ate get gag tct tct act ccc cga gtt gac 
Lys Leu Asn Gin Pro lie Ala Glu Ser Ser Thr Pro Arg Val Asp 
1115 1120 1125 

teg aat cga gac gtt tac cga gtc gqt teg ctt tct gat ttg acc 
ser Asn Arg Asp val Tyr Arg Val Gly ser Leu ser Asp Leu Thr 
1130 1135 1140 

tac aag ate aag cag agt cag acc gag ccc etc cga get gqt gtc 
Tyr Lys lie Lys Gin ser Gin Thr Glu Pro Leu Arg Ala Gly val 
1145 1150 1155 

atg acg age ttc aac aac ttg aag gag gtt cag gac gga etc ttg 
Met Thr Ser Phe Asn Asn Leu Lys Glu val Gin Asp Gly Leu Leu 
1160 1165 1170 

aat gtt ctg tct ttc ttc cct get tac cat cat caa gat ttc act 
Asn Val Leu ser Phe Phe Pro Ala Tyr His His Gin Asp Phe Thr 
1175 1180 1185 

caa cga cat gqt cag gac agt gcc atg ccc aac gtt etc aac att 
Gin Arg His Gly Gin Asp ser Ala Met Pro Asn Val Leu Asn lie 
1190 1195 1200 

get ate egg get ttc gag gag aag gac gac atg tct gat ctt gat 
Ala lie Arg Ala Phe Glu Glu Lys Asp Asp Met Ser Asp Leu Asp 
1205 1210 1215 

tgg gcc aag agt gtt gag teg ctg gta atg cag atg tct gcc gag 
Trp Ala Lys Ser val Glu Ser Leu Val Met Gin Met Ser Ala Glu 



3069 

3114 

3159 

3204 

3249 

3294 

3339 

3384 

3429 

3474 

3519 

3564 

3609 

3654 

3699 



1220 



1225 



1230 
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ate cag 
He Gin 
1235 

aag ggc 
Lys Gly 
1250 

cag gqc 
Gin Gly 
1265 

get eta 
Ala Leu 
1280 

gtc acc 
val Thr 
1295 

gga gtq 
Gly Va I 
1310 

get ttg 
Ala Leu 
1325 

gag tat 
Glu Tyr 
1340 

gac gec 
Asp Ala 
1355 



aag aag gga 



att cga cga gtt acc ttc ttg gtt 
Lys Lys Gly lie Arg Arg val Thr Phe Leu val 
1240 1245 



tgc cga 
cys Arg 



gtt tac ccc 
val Tyr Pro 



ccc tgg aga 
pro Trp Arg 



gec agt cag 
Ala Ser Gin 



cct ate ttc 
Pro lie Phe 



tec tac ttc acc ttc aga caa gag 
Ser Tyr Phe Thr Phe Arg Gin Glu Gly Ala 
1255 1260 



ggt gec 

3l\ 



gag gag gag aag att cga aac ate 
Glu Glu Glu Lys lie Arg Asn lie 
1270 1275 

ctt gag etc aac cga etc teg aat 
Leu Glu Leu Asn Arg Leu ser Asn 
1285 1290 

gta gac aac aga cag ate cac ate 
val Asp Asn Arg Gin lie His lie 
1300 1305 



ggt aag gag aac tct tec gat gtt cga ttc ttt 
Gly Lys Glu Asn Ser Ser Asp val Arg Phe Phe 
1315 1320 



gtt cga cct 
Val Arg Pro 



etc ate tec 
Leu lie Ser 



ttg gag gtt 
Leu Glu val 



gga egg gtc cag gga teg atg aag 
Gly Arg Val Gin Gly ser Met Lys 



1330 



1335 



gag tgc gat cga ctg etc act gat 
Glu Cys Asp Arg Leu Leu Thr Asp 
1345 1350 

gtt gga gee gag act cga aac gee 
Val Gly Ala Glu Thr Arg Asn Ala 
1360 1365 



gag cct 
Glu Pro 



ttc aag 
Phe Lys 



tac aag 
Tyr Lys 



ate egg 
lie Arg 



get gec 
Ala Ala 



ate ctg 
lie Leu 



gat tgc 
Asp cys 



3744 



3789 



3834 



3879 



3924 



3969 



4014 



4059 



4104 



aac cat 
Asn His 
1370 

gac gac 
Asp Asp 
1385 

aag agg 
Lys Arg 
1400 



gtt gga att aac ttc ate tat aac gtt ctt gtc 
val Gly He Asn Phe lie Tyr Asn val Leu val 
1375 1380 



gac ttc 
Asp Phe 



gtc cag gag gee ctt gee ggg ttc att gag agg 
Val Gin Glu Ala Leu Ala Gly Phe lie Glu Arg His Gly 
1390 1395 



cac gc)a 



ctt tgg cga 
Leu Trp Arg 



ctt cga gtg acc get tct gaa ate 
Leu Arg val Thr Ala Ser Glu lie 
1405 1410 



cga atg 
Arg Met 



4149 



4194 



4239 



gtt ctt 
val Leu 
1415 

att gag 
He Glu 
1430 

gag gtt 
Glu val 
1445 

gac ctt 
Asp Leu 
1460 

acc aag 
Thr Lys 
1475 



gag gac gac gag ggt aac gtc acc ccc ate cga 
Glu Asp Asp Glu Gly Asn val Thr Pro lie Arg 



1420 



1425 



aac gtt tct 
Asn Val Ser 



gag acc gag 
Glu Thr Glu 



ggt ttc gtc gtq aag tac cac gee 
Gly Phe Val val Lys Tyr His Ala 



1435 



1440 



tgc tgc 
Cys Cys 



tac cag 
Tyr Gin 



sag ggt act acc ate ttg aag tea ate gga 
Lys Gly Thr Thr lie Leu Lys ser lie Gly 
1450 1455 



gga cct ctt 
il\ 



cac ctt cag cct gtc aac cat get 
Gly Pro Leu His Leu Gin pro Val Asn His Ala 
1465 1470 



aac agt ctt 
Asn Ser Leu 



cag ccc cga cga tac cag get cac 

Gin Pro Arg Arg Tyr Gin Ala His 

1480 ' " ' 1485 

23 



tac cag 
Tyr Gin 



ttg gtt 
Leu val 



4284 



4329 



4374 



4419 



4464 
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1490 



tac gtc 
Tyr val 
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tac gac tac ccc gat etc ttc gtt 
Tyr Asp Tyr Pro Asp Leu Phe Val 
1495 1500 
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cag agt 
Gin ser 



4509 



ttg cgc aag 
Leu Arg Lys 
1505 



gtt tgg get gag get get get aag att cct cac etc 
val Trp Ala Glu Ala Ala Ala Lys lie Pro His Leu 
1510 1515 



4554 



egg gtg cct 
Arg val Pro 
1520 



age gag cct ctt acc get ace gag ttg gtt etc gat 
ser Glu Pro Leu Thr Ala Thr Glu Leu Val Leu Asp 
1525 1530 



4599 



gag aac aac 
Glu Asn Asn 
1535 



gag ctt cag gag gtc gag cga cct ccg ggt 
Glu Leu Gin Glu Val Glu Arg Pro Pro Gly 
1540 1545 



tec aac 
Ser Asn 



4644 



teg tgt 
Ser cys Gly 
1550 



ggt 
31V 



atg gtc gee tgg ate ttc act atg etc act ccc gag 
Met val Ala Trp lie Phe Thr Met Leu Thr Pro Glu 
1555 1560 



4689 



tat ccc aag 
Tyr Pro Lys 
1565 



ggt cga cga gta gtt gee att gee aac gat ate acc 
Gly Arg Arg Val val Ala lie Ala Asn Asp lie Thr 
1570 1575 



4734 



ttc aag att 
Phe Lys lie 
1580 



gga tec 
Gly Ser 



ttt ggt cct aag gaa gac gat tac ttc ttc 
Phe Gly Pro Lys Glu Asp Asp Tyr Phe Phe 
1S85 1590 



4779 



aag get act 
Lys Ala Thr 
1595 



gaa att gec aag aag ctg ggc ctt cct cga 
Glu lie Ala Lys Lys Leu Gly Leu Pro Arg 
1600 1605 



att tac 
lie Tyr 



4824 



etc tct gee 
Leu ser Ala 
1610 



aac agt gga get aga etc ggt ate gcg gag gag etc 4869 
Asn ser Gly Ala Arg Leu Gly lie Ala Glu Glu Leu 



1615 



1620 



ttg cac ate 
Leu His lie 
1625 



ttc aag gcg gee ttc gtt gac ccc gca aag cct tec 
Phe Lys Ala Ala Phe val Asp Pro Ala Lys Pro Ser 
1630 1635 



4914 



atg ggt att 
Met Gly lie 
1640 



aag tat eta tac ttg acc cct gaa act tta tec act 
Lys Tyr Leu Tyr Leu Thr Pro Glu Thr Leu ser Thr 
1645 1650 



4959 



ctt gee aag 
Leu Ala Lys 
1655 



tec age gtc acc act gag gag ate gag gat 
Lys Gly Ser Ser val Thr Thr Glu Glu lie Glu Asp 
1660 1665 



aag gga 
it 



5004 



gac ggc gag 
Asp Gly Glu 
1670 



cga cga cac aag ate acc gec ate ate ggt ctt gca 
Arg Arg His Lys lie Thr Ala lie lie Gly Leu Ala 
1675 1680 



5049 



gag ggt ttg 
Glu Gly Leu 
1685 



gga gtt 



gag tct ctt cga gga tec ggt ctt att get 5094 
~" ^Ty ser Gly Leu lie Ala 



Gly Val Glu ser Leu Arg GTy ser GTy Leu 
1690 1695 



gga gec 
Gly Ala 
1700 



acc act cga get tac gag gag gga ate ttc acc ate tct 5139 
Thr Thr Arg Ala Tyr_ Glu Glu Gly lie Phe Thr lie ser 



1705 



1710 



etc gtt act 
Leu Val Thr 
1715 



gee cga 
Ala Arg 



teg gtc ggt ate gga get tac ttg 
Ser val Gly lie Gly Ala Tyr Leu 
1720 1725 



gtt cga 
val Arg 



5184 



ttg ggt cag 
Leu Gly Gin 
1730 



cga get att cag gtt gaa ggc aac cct atg ate ctt 5229 
Arg Ala lie Gin val Glu Gly Asn Pro Met lie Leu 



1735 



1740 
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act gga get cag tct etc aac aag gtg ctt gga cga gag gtt tac 
Thr Gly Ala Gin Ser Leu Asn Lys val Leu Gly Arg Glu val Tyr 
1745 1750 1755 

act tec aac ctt cag ctt gga gga acc cag att atg gec cga aac 
Thr Ser Asn Leu Gin Leu Gly Gly Thr Gin lie Met Ala Arg Asn 
1760 1765 1770 

ggt acc acg cat etc gtc get gaa tct gat etc gat ggt get etc 
Gly Thr Thr His Leu val Ala Glu Ser Asp Leu Asp Gly Ala Leu 
1775 1780 1785 

aag gtc ate cag tgg etc teg tat gtq ccc gag cga aag gqc aag 
Lys val lie Gin Trp Leu ser Tyr val Pro Glu Arg Lys Gly Lys 
1790 1795 1800 

gec att cct ate tgg cct tec gag gac cct tgg gac cga act gtq 
Ala lie Pro lie Trp Pro ser Glu Asp Pro Trp Asp Arg Thr vat 
1805 1810 1815 

acc tac gag cct ccc cga gqt cct tac gat cct cga tgg ttg ctt 
Thr Tyr Glu Pro Pro Arg Gly Pro Tyr Asp Pro Arg Trp Leu Leu 
1820 1825 1830 

gaa gga aag ccg gat gaa gqc ttg act gqt ctt ttc gac aag gga 
Glu Gly Lys Pro Asp Glu Gly Leu Thr Gly Leu Phe Asp Lys Gly 
1835 1840 1845 

tct ttc atg gag acc ctt gga gat tgg gee aag act ate gtc acc 
Ser Phe Met Glu Thr Leu Gly Asp Trp Ala Lys Thr He Val Thr 
1850 1855 1860 

ggt cga gee cga ctg gga ggc att cct atg ggt gtt att get gtc 
Gly Arg Ala Arg Leu Gly Gly lie Pro Met Gly Val lie Ala Val 
1865 1870 1875 

gaa acc agg acg acc gag aag ate ate get gec gat cct gee aac 
Glu Thr Arg Thr Thr Glu Lys lie lie Ala Ala Asp Pro Ala Asn 
1880 ~ 1885 1890 

cct gca get ttc gag caa aag att atg gag get ggt cag gtt tgg 
Pro Ala Ala Phe Glu Gin Lys He Met Glu Ala Gly Gin Val Trp 
1895 1900 1905 

aac ccc aac get get tac aag acc get caa tec ate ttt gat ate 
Asn Pro Asn Ala Ala Tyr Lys Thr Ala Gin Ser lie Phe Asp lie 
1910 1915 1920 

aac aag gag gqt ctt cct ttg atg ate ctt gee aac ate cga ggt 
Asn Lys Glu Gly Leu Pro Leu Met lie Leu Ala Asn lie Arg Gly 
1925 1930 1935 

ttc tct gga gga cag ggt gat atg ttt gac get ate etc aag cag 
Phe ser Gly Gly Gin Gly Asp Met Phe Asp Ala lie Leu Lys Gin 
1940 1945 1950 

ggt tct aag ate gtt gac ggt etc teg aac ttc aag cag cca gtg 
Gly Ser Lys lie Val Asp Gly Leu ser Asn Phe Lys Gin Pro val 
1955 1960 1965 

ttc gtc tat gtt gtc ccc aac gga gag ctt cgt gga gga get tgg 
Phe val Tyr Val val Pro Asn Gly Glu Leu Arg Gly Gly Ala Trp 
1970 1975 1980. 

gtc gtg ttg gat cct act ate aac ctt gee aag atg gag atg tac 
Val val Leu Asp Pro Thr lie Asn Leu Ala Lys Met Glu Met Tyr 



5274 
5319 
5364 
5409 
5454 
5499 
5544 
5589 
5634 
5679 
5724 
5769 
5814 
5859 
5904 
5949 
5994 



1985 



1990 



1995 
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get gat gaa acc get cga 
Ala Asp Glu Thr Ala Arg 

2000 ~ 2005 

gtt gag ate aag ttc cga 
Val Glu lie Lys Phe Arg 
2015 

cga ttg gac gag acc tat 
Arg Leu Asp Glu Thr Tyr 
2030 

tea acc aag tct gcg gag 
ser Thr Lys Ser Ala Glu 
2045 

aag gca aga gag act eta 
Lys Ala Arg Glu Thr Leu 
2060 

cac ctt tac get gat etc 
His Leu Tyr Ala Asp Leu 
2075 

aag ggt tgc gcg aag cga 
Lys Gly Cys Ala Lys Arg 
2090 

ttc tac tgg cga ctt cga 
Phe Tyr Trp Arg Leu Arg 
2105 

tct aag ttc get get gec 
ser Lys Phe Ala Ala Ala 
2120 

caa aac att etc gac tct 
Gin Asn lie Leu Asp ser 
2135 

cga gee acc. get gaa tgg 
Arg Ala Thr Ala Glu Trp 
2150 

get gee gtt gec gaa gtc 
Ala Ala val Ala Glu Val 
2165 

ate age ttc gec gag acg 
lie Ser Phe Ala Glu Thr 
2180 

gtc get gtc ttg age act 
val Ala val Leu ser Thr 
2195 

gtt tct age ctt ggt etc taa 
val ser ser Leu Gly Leu 
2210 
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gqa gga att etc gag ccg gaa 
Gly Gly lie Leu Glu Pro Glu 



2010 



ggt ate 
Gly lie 



cga gac aag gtc ate get acc atg gag 
Arg Asp Lys val lie Ala Thr Met Glu 
2020 2025 

gec tct etc aaa get gec teg aac gac 
Ala Ser Leu Lys Ala Ala Ser Asn Asp 
2035 2040 

gag cga get aag agt get gag eta etc 
Glu Arg Ala Lys ser Ala Glu Leu Leu 
2050 2055 

ctt caa ccg acg tac ttg cag att gca 
Leu Gin Pro Thr Tyr Leu Gin lie Ala 
2065 2070 

cat gat cgt gtc gqa cga atg gag gee 
His Asp Arg val Gly Arg Met Glu Ala 
2080 " 2085 

get gtc tgg get gag get cga cga ttc 
Ala Val Trp Ala Glu Ala Arg Arg Phe 
2095 2100 

cga cgt etc aac gat gag cac ate ctg 
Arg Arg Leu Asn Asp Glu His lie Leu 
2110 2115 

aac ccg gat ctt act etc gag gag cga 
Asn Pro Asp Leu Thr Leu Glu Glu Arg 
2125 2130 

gtc gtc cag act gac etc act gat gac 
val val Gin Thr Asp Leu Thr Asp Asp 
2140 2145 

att gag cag tct gca gaa gag att get 
lie Glu Gin ser Ala Glu Glu lie Ala 
2155 2160 

cga tec acc tac gtg teg aat aag att 
Arg Ser Thr Tyr val Ser Asn Lys lie 
2170 2175 

gag cga get gqa gcg ttg cag ggc ttg 
Glu Arg Ala Gly Ala Leu Gin Gly Leu 
2185 " 2190 

ttg aat gcg gaa gac aag aag gec ctt 
Leu Asn Ala Glu Asp Lys Lys Ala Leu 
2200 2205 



6039 



6084 



6129 



6174 



6219 



6264 



6309 



6354 



6399 



6444 



6489 



6534 



6579 



■ 6624 



6645 



<210> 3 
<211> 2214 
<212> PRT 
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<213> Phaffia rhodozyma 



<400> 3 

Met val val Asp His Glu ser val Arg His Phe lie Gly Gly Asn Ala 
1 5 10 15 

Leu Glu Asn Ala Pro Pro ser ser val Thr Asp Phe val Arg ser Gin 
20 25 30 

Asp Gly His Thr Val He Thr Lys val Leu lie Ala Asn Asn Gly lie 
35 40 45 

Ala Ala Val Lys Glu lie Arg Ser Val Arg Lys Trp Ala Tyr Glu Thr 
50 * 55 60 

Phe Gly Asp Glu Arg Ala lie Glu Phe Thr val Met Ala Thr Pro Glu 
65 70 75 80 

Asp Leu Lys val Asn Cys Asp Tyr lie Arg Met Ala Asp Arg val val 
85 90 95 

Glu val Pro Gly Gly Thr Asn Asn Asn Asn His ser Asn val Asp Leu 
100 105 110 

lie Val Asp lie Ala Glu Arg Phe Asn lie His Ala val Trp Ala Gly 
115 120 125 

Trp Gly His Ala Ser Glu Asn Pro Arg Leu Pro Glu Ser Leu Ala Ala 
130 135 140 

Ser Lys Asn Lys lie val Phe lie Gly Pro Pro Gly Ser Ala Met Arg 
145 150 155 160 

ser Leu Gly Asp Lys lie Ser ser Thr lie val Ala Gin Ser Ala Gin 
165 170 175 

val Pro cys Met Ala Trp ser Gly ser Gly lie Thr Asp Thr Glu Leu 
180 185 190 

Ser Pro Gin Gly Phe val Thr val Pro Asp Gly Pro Tyr Gin Ala Ala 
195 200 205 

Cys Val Lys Thr val Glu Asp Gly Leu Val Arg Ala Glu Lys lie Gly 
210 215 220 

Leu Pro Val Met lie Lys Ala Ser Glu Gly Gly Gly Gly Lys Gly lie 
225 230 235 240 

Arg Met Val His Ser Met Asp Thr Phe Lys Asn ser Tyr Asn Ser Val 
245 250 255 
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Ala Ser Glu val Pro Gly Ser Pro lie Phe lie Met Ala Leu Ala Gly 
260 265 270 



ser Ala Arg His Leu Glu Val Gin Leu Leu Ala Asp Gin Tyr Gly Asn 
275 280 285 



Ala lie Ser Leu Phe Gly Arg Asp Cys ser val Gin Arg Arg His Gin 
290 295 300 



Lys lie lie Glu Glu Ala Pro Val Thr lie Ala Arg Pro Glu Arg Phe 
305 310 315 ~ 320 



Glu Glu Met Glu Lys Ala Ala Val Arg Leu Ala Lys Leu Val Gly Tyr 
325 330 335 



Val ser Ala Gly Thr val Glu Tyr Leu Tyr ser His Ala Asp Asp ser 
340 345 350 



Phe Phe Phe Leu Glu Leu Asn Pro Arg Leu Gin val Glu His Pro Thr 
355 360 ~ 365 



Thr Glu Met val Ser Gly val Asn Leu Pro Ala Ala Gin Leu Gin lie 
370 375 380 



Ala Met Gly lie Pro Leu Ser Arg lie Arg Asp lie Arg val Leu Tyr 
385 390 395 400 



Gly Leu Asp Pro His Thr val ser Glu lie Asp Phe Asp Ser ser Arg 
405 410 415 



Ala Glu ser val Gin Thr Gin Arg Lys Pro Arg Pro Lys Gly His Val 
420 425 430 



lie Ala Cys Arg lie Thr ser Glu Asn Pro Asp Glu Gly Phe Lys Pro 
435 440 445 



Ser Ala Gly Asp lie Gin Glu Leu Asn Phe Arg ser Asn Thr Asn val 
450 455 460 



Trp Gly Tyr Phe Ser Val Gly Ala Thr Gly Gly He His ser Phe Ala 
465 470 475 480 



Asp ser Gin Phe Gly His val Phe Ala Tyr Gly ser Asp Arg Thr Thr 
485 490 495 



Ala Arg Lys Asn Met val lie Ala Leu Lys Glu Leu Ser lie Arg Gly 
500 505 510 



Asp Phe Arg Thr Thr Val Glu Tyr Leu lie Thr Leu Leu Glu Thr Ser 
515 520 525 
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Asp Phe Glu Gin Asn Ala lie Thr Thr Ala Trp Leu Asp Gly Leu lie 
530 535 540 



Thr Asn Lys Leu Thr Ser Glu Arg Pro Asp Pro Ser Leu Ala Val lie 
545 550 ~ 555 560 



cys Gly Ala lie val Lys Ala His val Ala ser Glu Asn cys Trp Ala 
565 570 575 



Glu Tyr Arg Arg val Leu Asp Lys Gly Gin val Pro ser Lys Asp Thr 
580 585 590 



Leu Lys Thr val Phe Thr Leu Asp Phe lie Tyr Glu Gly val Arg Tyr 
595 600 605 



Asn Phe Thr Ala Ala Arg Ala ser Leu Asn Thr Tyr Arg Leu Tyr Leu 
610 615 620 



Asn Gly Gly Lys Thr val val Ser lie Arg Pro Leu Ala Asp Gly Gly 
625 630 635 640 



Met Leu val Leu Leu Asp Gly Arg Ser His Thr Leu Tyr Trp Arg Glu 
645 650 655 



Glu val Gly Thr Leu Arg lie Gin val Asp Ala Lys Thr Cys Leu lie 
660 665 670 



Glu Gin Glu Asn Asp Pro Thr Gin Leu Arg Ser Pro Ser Pro Gly Lys 
675 680 685 



lie lie Arg Phe Leu val Glu Ser Gly Asp His lie Ser ser Gly Asp 
690 ~ 695 700 



lie Tyr Ala Glu Val Glu Val Met Lys Met lie Leu Pro Leu lie Ala 
705 710 715 720 



Gin Glu Ser Gly His val Gin Phe Val Lys Gin Ala Gly val Thr val 
725 730 735 



Asp Pro Gly Ala lie lie Gly lie Leu ser Leu Asp Asp Pro Thr Arg 
740 745 750 



Val Lys Lys Ala Lys Pro Phe Glu Gly Leu Leu Pro Val Thr Gly Leu 
755 760 765 



Pro Asn Leu Pro Gly Asn Arg Pro His Gin Arg Leu Gin Phe Gin Leu 
770 775 780 



Glu ser lie Tyr Ser Val Leu Asp Gly Tyr Glu Ser Asp Ser Thr Ala 
785 790 795 800 
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Thr He Leu Arg Ser Phe ser Glu Asn Leu Tyr Asp Pro Asp Leu Ala 
805 810 815 



Phe Gly Glu Ala Leu Ser lie lie ser Val Leu Ser Gly Arg Met Pro 
820 825 830 



Ala Asp Leu Glu Glu Ser lie Arg Glu Val lie ser Glu Ala Gin Ser 
835 840 845 



Pro His Ala Glu Phe Pro Gly Ser Lys lie Leu Lys Val Val Glu 
850 855 860 



Arg Tyr lie Asp Asn Leu Arg Pro Gin Glu Arg Ala Met val Arg Thr 
865 870 ~ 875 880 



Gin lie Glu Pro lie Val Gly lie Ala Glu Lys Asn val Gly Gly Pro 
885 890 895 



Lys Gly Tyr Ala ser Tyr Val Leu Ala Thr lie Leu Gin Lys Phe Leu 
900 905 910 



Ala Val Glu Ala val Phe Ala Thr Gly Ser Glu Glu Ala lie val Leu 
915 920 925 



Gin Leu Arg Asp Glu Asn Arg Glu ser Leu Asn Asp val Leu Gly Leu 
930 ~ 935 940 



Val Leu Ala His Ser Arg Leu Ser Ala Arg ser Lys Leu val Leu ser 
945 950 955 960 



val Phe Asp Leu lie Lys Ser Met Gin Leu Leu Asn Asn Thr Glu Gly 
965 970 975 



Ser Phe Leu His Lys Thr Met Lys Ala Leu Ala Asp Met Pro Thr Lys 
980 985 990 



Ala Pro Leu Ala Ser Lys Val Ser Leu Lys Ala Arg Glu lie Leu lie 
995 1000 1005 



Ser Cys Ser Leu Pro Ser Tyr Glu Glu Arg Leu Phe Gin Met Glu 
1010 1015 " 1020 



Lys lie Leu Asn ser Ser val Thr Thr Ser Tyr Tyr Gly Glu Thr 
1025 1030 1035 



Gly Gly Gly His Arg Asn Pro ser val Asp val Leu Thr Glu lie 
1040 1045 1050 



ser Asn ser Arg Phe Thr val Tyr Asp val Leu ser ser Phe Phe 



1055 



1060 



1065 
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Lys His Asp Asp pro Trp lie val Leu Ala ser Leu Thr val Tyr 
1070 1075 1080 



val Leu Arg Ala Tyr Arg Glu Tyr ser He Leu Asp Met Gin His 
1085 " 1090 1095 



Glu Gin Gly Gin Asp Gly Ala Ala Gly Val lie Thr Trp Arg Phe 
1100 1105 1110 



Lys Leu Asn Gin Pro lie Ala Glu Ser Ser Thr Pro Arg Val Asp 
1115 1120 1125 



Ser Asn Arg Asp Val Tyr Arg Val Gly Ser Leu Ser Asp Leu Thr 
1130 ~ 1135 1140 



Tyr Lys lie Lys Gin ser Gin Thr Glu Pro Leu Arg Ala Gly Val 
1145 1150 1155 



Met Thr Ser Phe Asn Asn Leu Lys Glu Val Gin Asp Gly Leu Leu 
1160 1165 1170 



Asn val Leu ser Phe Phe Pro Ala Tyr His His Gin Asp Phe Thr 
1175 1180 1185 



Gin Arg His Gly Gin Asp Ser Ala Met Pro Asn Val Leu Asn lie 
1190 1195 1200 



Ala lie Arg Ala Phe Glu Glu Lys Asp Asp Met ser Asp Leu Asp 
1205 ~ 1210 1215 



Trp Ala Lys Ser Val Glu Ser Leu Val Met Gin Met Ser Ala Glu 
1220 1225 1230 



lie Gin Lys Lys Gly lie Arg Arg Val Thr Phe Leu val Cys Arg 
1235 1240 1245 



Lys Gly val Tyr Pro ser Tyr Phe Thr Phe Arg Gin Glu Gly Ala 
1250 1255 1260 



Gin Gly Pro Trp Arg Glu Glu Glu Lys lie Arg Asn lie Glu Pro 
1265 " 1270 1275 



Ala Leu Ala Ser Gin Leu Glu Leu Asn Arg Leu ser Asn Phe Lys 
1280 1285 1290 



val Thr Pro lie Phe val Asp Asn Arg Gin lie His lie Tyr Lys 
1295 1300 " 1305 



Gly Val Gly Lys Glu Asn Ser Ser Asp val Arg Phe Phe lie Arg 



1310 



1315 



1320 
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Ala Leu val Arg Pro Gly Arg val Gin Gly Ser Met Lys Ala Ala 
1325 1330 1335 



Glu Tyr Leu He ser Glu cys Asp Arg Leu Leu Thr Asp lie Leu 
1340 1345 1350 



Asp Ala Leu Glu Val Val Gly Ala Glu Thr Arg Asn Ala Asp cys 
1355 1360 1365 



Asn His val Gly lie Asn Phe lie Tyr Asn Val Leu val Asp Phe 
1370 1375 1380 



Asp Val Gin Glu Ala Leu Ala Gly Phe lie Glu Arg His Gly 
1385 1390 1395 



Lys Arg Leu Trp Arg Leu Arg Val Thr Ala Ser Glu lie Arg Met 
1400 1405 1410 



val Leu Glu Asp Asp Glu Gly Asn val Thr Pro lie Arg Cys cys 
1415 1420 1425 



lie Glu Asn val Ser Gly Phe Val Val Lys Tyr His Ala Tyr Gin 
1430 1435 1440 



Glu val Glu Thr Glu Lys Gly Thr Thr lie Leu Lys Ser lie Gly 
1445 1450 1455 



Asp Leu Gly Pro Leu His Leu Gin Pro val Asn His Ala Tyr Gin 
1460 1465 1470 



Thr Lys Asn ser Leu Gin Pro Arg Arg Tyr Gin Ala His Leu Val 
1475 1480 ~ 1485 



Gly Thr Thr Tyr Val Tyr Asp Tyr Pro Asp Leu Phe val Gin ser 
1490 * 1495 1500 



Leu Arg Lys val Trp Ala Glu Ala Ala Ala Lys lie Pro His Leu 
1505 1510 1515 



Arg val Pro Ser Glu Pro Leu Thr Ala Thr Glu Leu Val Leu Asp 
1520 1525 1530 



Glu Asn Asn Glu Leu Gin Glu Val Glu Arg Pro Pro Gly Ser Asn 
1535 1540 1545 



Ser cys Gly Met val Ala Trp lie Phe Thr Met Leu Thr Pro Glu 
1550 1555 1560 



Tyr Pro Lys Gly Arg Arg Val Val Ala lie Ala Asn Asp lie Thr 



1565 



1570 



1575 
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Phe Lys lie Gly Ser Phe Gly Pro Lys Glu Asp Asp Tyr Phe Phe 
1580 1585 1590 



Lys Ala Thr Glu lie Ala Lys Lys Leu Gly Leu Pro Arg lie Tyr 
1595 1600 1605 



Leu ser Ala Asn Ser Gly Ala Arg Leu Gly lie Ala Glu Glu Leu 
1610 1615 ~ 1620 



Leu His lie Phe Lys Ala Ala Phe val Asp Pro Ala Lys Pro ser 
1625 1630 1635 



Met Gly lie Lys Tyr Leu Tyr Leu Thr Pro Glu Thr Leu Ser Thr 
1640 1645 1650 



Leu Ala Lys Lys Gly ser ser val Thr Thr Glu Glu lie Glu Asp 
1655 1660 1665 



Asp Gly Glu Arg Arg His Lys lie Thr Ala lie lie Gly Leu Ala 
1670 ~ 1675 1680 



Glu Gly Leu Gly Val Glu Ser Leu Arg Gly Ser Gly Leu lie Ala 
1685 1690 1695 



Gly Ala Thr Thr Arg Ala Tyr Glu Glu Gly lie Phe Thr lie Ser 
1700 1705 1710 



Leu val Thr Ala Arg ser val Gly lie Gly Ala Tyr Leu val Arg 
1715 " 1720 1725 



Leu Gly Gin Arg Ala lie Gin Val Glu Gly Asn Pro Met lie Leu 
1730 " 1735 1740 



Thr Gly Ala Gin Ser Leu Asn Lys Val Leu Gly Arg Glu val Tyr 
1745 1750 1755 



Thr Ser Asn Leu Gin Leu Gly Gly Thr Gin lie Met Ala Arg Asn 
1760 1765 1770 



Gly Thr Thr His Leu val Ala Glu ser Asp Leu Asp Gly Ala Leu 
1775 1780 1785 



Lys val lie Gin Trp Leu Ser Tyr Val Pro Glu Arg Lys Gly Lys 
1790 1795 1800 



Ala lie Pro lie Trp Pro Ser Glu Asp Pro Trp Asp Arg Thr Val 
1805 1810 1815 



Thr Tyr Glu Pro Pro Arg Gly Pro Tyr Asp Pro Arg Trp Leu Leu 



1820 



1825 



1830 
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Glu Gly Lys Pro Asp Glu Gly Leu Thr Gly Leu Phe Asp Lys Gly 
1835 1840 " 1845 



Ser Phe Met Glu Thr Leu Gly Asp Trp Ala Lys Thr lie val Thr 
1850 1855 1860 



Gly Arg Ala Arg Leu Gly Gly lie Pro Met Gly Val lie Ala val 
1865 " 1870 1875 



Glu Thr Arg Thr Thr Glu Lys lie lie Ala Ala Asp Pro Ala Asn 
. 1880 1885 1890 



pro Ala Ala Phe Glu Gin Lys lie Met Glu Ala Gly Gin val Trp 
1895 1900 1905 



Asn Pro Asn Ala Ala Tyr Lys Thr Ala Gin Ser lie Phe Asp lie 
1910 1915 1920 



Asn Lys Glu Gly Leu Pro Leu Met lie Leu Ala Asn lie Arg Gly 
1925 1930 1935 



Phe ser Gly Gly Gin Gly Asp Met Phe Asp Ala He Leu Lys Gin 
1940 1945 1950 



Gly Ser Lys He val Asp Gly Leu Ser Asn Phe Lys Gin Pro val 
1955 1960 1965 



Phe Val Tyr val Val Pro Asn Gly Glu Leu Arg Gly Gly Ala Trp 
1970 1975 1980 



Val Val Leu Asp Pro Thr lie Asn Leu Ala Lys Met Glu Met Tyr 
1985 1990 1995 



Ala Asp Glu Thr Ala Arg Gly Gly lie Leu Glu Pro Glu Gly lie 
2000 2005 2010 



val Glu lie Lys Phe Arg Arg Asp Lys val lie Ala Thr Met Glu 
2015 2020 2025 



Arg Leu Asp Glu Thr Tyr Ala ser Leu Lys Ala Ala ser Asn Asp 
2030 2035 2040 



Ser Thr Lys ser Ala Glu Glu Arg Ala Lys ser Ala Glu Leu Leu 
2045 2050 2055 



Lys Ala Arg Glu Thr Leu Leu Gin Pro Thr Tyr Leu Gin lie Ala 
2060 2065 2070 



His Leu Tyr Ala Asp Leu His Asp Arg val Gly Arg Met Glu Ala 



2075 



2080 
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Lys Gly cys Ala Lys 
2090 



Arg Ala val Trp Ala Glu Ala Arg Arg Phe 
2095 2100 



Phe Tyr Trp Arg Leu Arg Arg Arg Leu Asn Asp Glu His lie Leu 
2105 2110 2115 



Ser Lys Phe Ala Ala Ala Asn Pro Asp Leu Thr Leu Glu Glu 
2120 2125 2130 



Gin Asn lie Leu Asp Ser val Val Gin Thr Asp Leu Thr Asp 
2135 2140 2145 



Arg Ala Thr Ala Glu Trp He Glu Gin ser Ala Glu Glu lie Ala 
2150 2155 2160 



Ala Ala Val Ala Glu Val Arg ser Thr Tyr val ser Asn Lys lie 
2165 2170 2175 



lie ser Phe Ala Glu Thr Glu Arg Ala Gly Ala Leu Gin Gly Leu 
2180 2185 2190 



val Ala val Leu Ser Thr Leu Asn Ala Glu Asp Lys Lys Ala Leu 
2195 2200 2205 



Val Ser ser Leu Gly Leu 
2210 

<210> 4 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<221> mi sc^feature 

<222> (6) . . (6) 

<223> n is a, c, g or t 

<220> 

<2 2 1> mi s c_f eat u re 

<222> (9) . . (9) 

<223> n is a, c, g or t 
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<220> 

<221> misc_feature 

<222> (15) . . (15) 

<223> n is a, c, g or t 



<220> 

<221> misc_feature 

<222> (18) . . (18) 

<223> n is a, c, g or t 

<220> 

<221> misc_feature 

<222> (21).. (21) 

<223> n is a, c, g or t 



<220> 

<221> misc_feature 

<222> (24).. (24) 

<223> n is a, c, g or t 



<400> 4 

athggngcnt ayytngynmg nytngg 26 

<210> 5 

<211> 25 

<212> DNA 

<213> Artificial 



<220> 

<221> misc_feature 

<222> (3).. (3) 

<223> n is a, c, g or t 



<220> 

<221> misc_feature 
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<222> (6).. (6) ... 

<223> n is a, c, g or t 



<220> 

<221> misc_feature 

<222> (12).. (12) 

<223> n is a, c, g or t 



<220> 

<221> misc_feature 

<222> (15).. (15) 

<223> n is a, c, g or t 



<220> 

<221> miscfeature 

<222> (18). .(18) 

<223> n is a, c, g or t 



<220> 

<221> mis cofeature 

<222> (21) . . (21) 

<223> n is a, c, g or t 



<220> 

<221> mis cofeature 

<222> (24) . . (24) 

<223> n is a, c, g or t 



<400> 5 

acnacnaccc angcnccncc nckna 25 

<210> 6 

<211> 26 

<212> DNA 

<213> Artificial 
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<400> 6 

ttaccctcgt cgtcctcaag aaccat 



26 



<210> 7 

<211> 26 

<212> DNA 

<213> Artificial 

<400> 7 

tggatcctac tatcaacctg ccaaga 26 

<210> 8 

<211> 26 

<212> DNA 

<213> Artificial 



<210> 9 

<211> 20 

<212> DNA 

<213> Artificial 

<400> 9 



<210> 10 

<211> 19 

<212> DNA 

<213> Artificial 

<400> 10 

gattagatag ggatctagt 19 



<400> 8 

gtgaacactg tcttgagagt gtcctt 



26 



ccgctgctca gcttcagatt 



20 
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